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DELIVERY OF MULTIMEDIA DESCRIPTIONS USING ACCESS UNITS 



RELATED APPLICATIONS 

[0001] This application claims the benefit of U.S. Provisional Application Nos. 

60/272 ,729, filed March 1, 2001, 60/304,864 filed July 11, 2001, 60/260,949, filed 
January 10, 2001, and 60/242,378, filed October 20, 2000, the above applications herein 
incorporated by reference. 
FIELD OF THE INVENTION 

[0002] This invention relates generally to transmitting data. More specifically, the 
invention relates to mechanisms and techniques that enable data related to multimedia 
content descriptions to be encoded and transmitted. 
NOTICE OF COPYRIGHT 

[0003] A portion of the disclosure of this patent document contains material that is 
subject to copyright protection. The copyright owner has no objection to the facsimile 
reproduction by anyone of the patent document or the patent disclosure as it appears in 
the Patent and Trademark Office patent file or records, but otherwise reserves all 
copyright rights whatsoever. The following notice applies to the software and data as 
described below and in the drawings hereto: Copyright © 2000, Sony Electronics, Inc., 
All Rights Reserved. 
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BACKGROUND OF THE INVENTION 

[0004] Digital multimedia information is becoming widely distributed though 
broadcast transmission, such as digital television signals, and interactive transmission, 
such as the Internet. The information may be in still images, audio feeds, or video data 
streams. The Moving Picture Experts Group (MPEG) has promulgated a Multimedia 
Content Description Interface, commonly referred to as MPEG-7, to standardize the 
description of multimedia information when it is transmitted from a system that generates 
the information to a system that uses the information. 

[0005] MPEG-7 defines a generic structure for describing multimedia content. For 
example, the structure for a standard movie could include scenes, shots within scenes, 
titles for scenes, and time, color, shape, motion, and audio feature information for shots. 
The corresponding description would contain a "descriptor" component that describes 
features of the content, such as color, shape, motion, frequency, title, etc., and a 
"description scheme" component that describe relationships among two or more 
descriptors, e.g., a shot description scheme that relates together the features of a shot. For 
example, Fourier descriptors and polygon vertices are representations, or descriptors for a 
shape feature. A description scheme can also describe the relationship among other 
description schemes and between description schemes and descriptors, e.g., a scene 
description scheme that relates the different shots in a scene and relates the title of the 
scene to the shots. 

[0006] The structure and format of an MPEG-7 content description is defined by a 
schema using a Description Definition Language (DDL), which is designed to define 
descriptors and description schemes. For each descriptor, the schema specifies the 
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syntax and semantics of the corresponding feature. For each description scheme, the 
schema specifies the structure and semantics of the relationships among its children 
components, which are descriptors and description schemes. The DDL for MPEG-7 
multimedia content is based on the XML (extensible markup language) and the XML 
Schema standards. The descriptors, description schemes, semantics, syntax, and 
structures of the content description are represented with XML elements and XML 
attributes. Some of the XML elements and attributes may be optional. 
[0007] A multimedia content description is encoded in an XML "instance document" 
that references the appropriate schema and is an instance of the schema defined in DDL; 
that is, it contains data that adheres to the syntax and semantics defined in the DDL 
schema. The instance document contains a set of "descriptor values" for the required 
elements and attributes in the schema and for any necessary optional elements and/or 
attributes. An instance document is transmitted from a system having or generating the 
description data, through a communication channel such as a computer network, to 
another system that will a consume the multimedia content description data contained in 
the instance document. 

[0008] When transmitting a multimedia description represented in XML it possible to 
send the data in either textual form or to encode the XML data into another binary form, 
such as the binary format specified for MPEG-7 data, known as "BiM". 
[0009] Figure 1 illustrates one example of transmitting a multimedia document. In 
system 1 10, content description 1 12 is a multimedia description, for example a 
description of a movie. Encoder 114 encodes content description 1 12 in a format suitable 
for transmission. After encoding, encoded data stream 1 16 represents the encoded form 
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used for transmitting content description 1 12 to decoder 118 over some communication 
channel (not shown). Decoder 118 receives encoded data stream 1 16 and uses it to 
reconstruct content description 120, which is the same as content description 112. 
Encoder 114 may use various methods to form the encoded data used to transmit content 
description 112. 

[0010] While various means may be used by encoder 1 14 to transmit encoded data 
streaml 16 to decoder 118, normally the description data must be sent as a encoded data 
stream over a communication channel, such as a network communication channel using a 
communications protocol like TCP/IP. Although compression can reduce transmission 
time by decreasing the size of the encoded stream, if the description is large, transmitting 
the entire description over a network can still take too much time. Rather than sending 
the entire description, the encoder may send parts of the description to the encoder. The 
method for determining which parts to send is not the subject of this invention and is 
application dependent. Herein, the term "description fragment" means a part of a 
description. 

[0011] The content description 112 may be either static or dynamic. A static 
description is one that does not change during the duration of communication between 
encoder 1 14 and decoder 1 18. A dynamic description is one in which the description data 
1 12 is changed during the duration of the communication between encoder 1 14 and 
decoder 118. This change must be reflected in the description data 120. Description data 
1 12 is changed when data pertaining to the multimedia content is added, deleted, or 
changed. For example, if the description pertains to a scene being captured by a camera 
the description may change if an object appears or disappears from the scene. 
Additionally, if the description is about a television program, portions of the data may be 
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updated, e.g. the broadcast time of the program may changes, and such changes sent from 
the encoder to decoder. When the description is dynamic it is more efficient to send only 
updates to the encoder rather the resending the entire changed description. 
[0012] Furthermore, in a distributed environment, description data is spread across 
multiple hosts and each host contains only a part of a complete description. If all 
description fragments (i.e., parts of the description) are streamed from a single encoder, 
then the encoder spends a significant amount of time and resources in gathering all the 
description fragments. 
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SUMMARY OF THE INVENTION 

[0013] One embodiment is a method comprising the following process: form an 
access unit comprising a fragment update, where the fragment update comprises a 
fragment update command; form an encoded data stream from the access unit. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

[0014] Figure 1 is prior art and illustrates an example of transmitting a description 
from an encoder to a decoder. 

[0015] Figure 2 illustrates an example of a content description tree and its fragments. 
[0016] Figure 3 illustrates an example of an access unit. 
[0017] Figure 4 illustrates an example of a fragment update. 

[0018] Figure 5a is a flow diagram showing blocks, which when implemented by an 
encoder send an encoded data stream to decoder 510. 

[0019] Figure 5b is a flow diagram showing blocks, which when implemented by a 
decoder provide instance document 1 16 with both text and binary encoding. 
[0020] Figure 6 is a flow diagram showing one embodiment of the changes made to 
BiM syntax. 

[0021] Figure 7 illustrates one example of BiM syntax. 

[0022] Figure 8 illustrates an example of a distributed MPEG-7 description. 

[0023] Figure 9 illustrates one example of a fragment reference. 

[0024] Figure 10 illustrates one example of internal and external pointers. 

[0025] Figure 1 1 illustrates one example of a computer system that implements the 

principles of the invention. 
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DETAILED DESCRIPTION 

[0026] In the following description, numerous specific details such as specific 
materials, processing parameters, processing blocks, etc., are set forth in order to provide 
a though understanding of the invention. One skilled in the art will recognize that these 
details need not be specifically adhered to in order to practice the claimed invention. In 
other instances, well known processing blocks, materials, etc., are not set forth in order 
not to obscure the invention. Furthermore, different embodiments of this invention may 
use different syntaxes for encoding substantively the same information. For example, the 
use of different names, attributes, elements to form alternative embodiments of this 
invention will be will be obvious to one well-versed in the art. 

[0027] This invention provides an efficient method to deliver MPEG-7 description 
data from an encoder to a decoder using access units, a method for configuring a system 
to receive access units, and a method for using access units for communication of 
distributed multimedia content description. Furthermore, the invention is not limited to 
MPEG-7 multimedia description and may also be applied to any structured data 
represented using XML. 

[0028] When sending a description from a decoder to an encoder, the description is 
usually divided into pieces that are sent to a decoder one after another. In the following, 
"access unit" is used to refer to the smallest unit for transmitting a part of a description 
from an encoder to a decoder across a communication channel or inside a stream. The 
term is chosen because the access unit is the smallest unit for a decoder accessing a 
description. When transmitting a description the encoder divides the description into 
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fragments, encodes these as access units, and sends access units to the decoder where 
they are used to reconstruct the description. 

[0029] The description tree, defined in detail later, is a representation of hierarchical 
structure of elements, attributes, text values and so on, that make up an instance 
document. A "fragment" is an arbitrarily connected component of the description tree that 
is not necessarily complete with respect to a schema definition, such as MPEG-7 DDL. In 
other words, a fragment is a sub-tree of the description tree. 

[0030] An encoded transmission format typically provides information pertaining to 
the method of encoding transmitted data. For example, encoder 1 14 sends information 
about the encoded data stream 1 16 to the decoder. Such information may be sent over the 
same communication channel or stream (e.g. "in-band") or "out-of-band"; that is, on a 
separate communication channel from that used for sending the encoded description 
stream. An encoder provides information about the resources needed for decoding so that 
a decoder can correctly decode an instance document. Information about resources 
needed for decoding allows a system to determine if the needed resources are present and 
available, and if not then to send a command to stop transmission. One method of 
providing resource information is to send a global element with information (e.g., 
MPEG7 JDecoderConfig) at the beginning of a session. 

[0031] When the description data is transmitted or streamed, for example when a 
description is broadcast, a decoder needs to determine a correct beginning point within 
encoded data stream. To do so the encoded data typically has synchronization points. 
Synchronization points are points in the encoded data stream where a decoder can form 
an initial state, start decoding, and then properly decode subsequent encoded data. 
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[0032] Synchronization points enable random access to the encoded stream of 
description data. This is analogous to the "I-Frame" used in video compression standard, 
such as the MPEG-1 and MPEG-2 standards. Such access points are associated with a 
partial description, or sub-tree, where a decoder can access only a part of the entire 
description. Such sub-trees must be semantically meaningful points. That is, they must be 
sub-tree that is an instance of a descriptor, description scheme, or datatype defined by the 
MPEG-7 standard schema. Such semantically meaningful points may be defined as 
MPEG7Unit. 

[0033] Another example of a synchronization point is a reset point that contains a 
fragment that forms a complete description with respect to the schema, for example 
MPEG7Main, a complete, independent, descriptive entity. 

[0034] In some cases a multimedia content description is distributed over multiple 
servers. In one embodiment, in order to support rapid and efficient transmission of 
multimedia work stored on multiple servers the encoder transmits the location of 
description fragments instead of transmitting the description fragments themselves to the 
decoder. The decoder can then directly retrieve the description fragments from other 
servers. 

[0035] Figure 2 illustrates an example of a content description tree and its fragments. 
Content description tree 200 represents a multimedia content description and is part of 
the data in instance document 112. The structure of the description is determined from 
the hierarchical structure of the XML elements and attributes used to represent the 
multimedia description. The details of this structure are specified in the DDL schema for 
the description. As shown in Figure 2, a description can be divided into a set of fragments 
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that can be "glued" back together to reconstruct the original description tree. For 
example, encoder 1 14 divides content description tree 200 into a set of fragments 202. 
Fragments 202 1-3 are subtrees of content description tree 200. 
[0036] One method for transmitting a description is for the decoder to send it as a 
series of fragments, which are transmitted and received by the decoder, which then 
recombines them to reconstruct the complete description. Because each fragment 202 is 
sent separately, there must be information provided for each fragment 202 that identifies 
its location within content description tree 200. Fragments 202- 2 and 3 have information 
relating their position to fragment 202-1. Consequently, even though fragments 202 are 
sent separately, a corresponding decoder can reassemble fragments 202 into content 
description tree 200. The encoder does so by sending the fragment with its associated 
information as an access unit. 

[0037] Figure 3 is an example of an access unit. Henceforth, the abbreviation AU 
refers to an "access unit". Access unit 300 is composed of an optional digital signature 
302 and one or more fragment updates 304. One embodiment of access unit 300 is an 
MPEG-7 system datatype to which timing information is attached. Access units typically 
do not carry the same timing information. The timing information is used for, in one 
example, to synchronize a transmitter (not shown) and a receiver (not shown) or to 
coordinate the delivery of the description with its associated multimedia content. In 
many communication protocols, the packets in the transport layer packets carry timing 
information, such as delivery times. 

[0038] Figure 4 is one example of a fragment update. Fragment update 400 is 
composed of three parts. Navigation part 402 points to a location in content description 
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tree 200 where update command 404 should execute. In other words, it specifies how to 
navigate to the node in the description where the command is to be applied. Update 
command 304 specifies the type of update to execute, for example to add, delete, or 
replace a fragment. Payload 406 provides the value of fragment 202 to be added or 
replaced but is not present when a delete command is sent. 

[0039] The inclusion of an update command inside of an access unit allows an 
encoder to send either static or dynamic descriptions. When the description being 
transmitted is static, the encoder can divide the description into fragments, which will 
typically consists of add commands to build up the description. When the description is 
dynamic, an encoder can send a series one or more access units that contain update 
commands to effect any changes on the source description at the destination decoder. 
[0040] Figure 5a is a flow diagram of blocks, which when implemented by an 
encoder and a decoder encode an access unit. In block 502 the decoder calculate the 
description tree fragments that need to be sent or updated. The process for doing this is 
encoder and application dependent and is not discussed here. In block 504 the decoder 
determines the navigation paths for the updated description. These are the paths to the 
root nodes of the description sub-trees that are to be updated. In block 506 the encoder 
gathers together all of the information needed to send the update and fragment in an 
access unit. For example, the decoder may represent the information in an access using an 
XML syntax discussed later or in a binary encoding. In block 508 the encoder sends the 
access unit information to the decoder in an encoded data stream. 
[0041] Figure 5b is a flow diagram of blocks, which when implemented by an 
encoder and a decoder decode an access unit. The decoder performs the processing 
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depicted in Figure 5b. In block 512 the decoder receives an access unit from the encoded 
data stream. In block 514 the information in the AU is decoded. For example, this may 
involve decoding a binary form of the access unit or parsing a textual XML 
representation of the access unit. In block 516, the navigation part is used to determine 
the affected node in the current description maintained by the decoder. In MPEG-7, for 
example, the navigation is expressed as an XPath expression that locates the root of the 
sub-tree that the decoder will update. 

[0042] In block 518, complete the access units. Each access unit has a set of update 
commands and an indication of whether or not the access unit is applied serially or in 
parallel. In this block the update command is examined in order to determine which 
operation is specified: an add, delete, or replace command. If it is a delete command the 
sub-tree is deleted by the decoder in block 520. In block 522 the fragment in the update 
command is added to a designated node. In block 524 the designated fragment is 
replaced by the fragment value in the update command. 

[0043] Figure 6 is a flow diagram showing one embodiment of block 516 when the 
navigation path is represented as an Xpath expression (or equivalent path representation). 
Determine if there is a 7\ block 600. If there is no V then absolute addressing is not 
used, block 602. 

[0044] Determine if the V is followed by a root node, block 604. If the V is 
followed by a root node then conform to mode '00' of BiM navigation command, block 
606. 
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[0045] Determine if the 7' is followed by a top level node, step 608. If the 7' is 
followed by a top level node then conform to mode '01' of BiM navigation command, 
block 610. 

[0046] Determine if there is a V, block 612. If there is a V Then context node is 
indicated, block 614. 

[0047] Determine if there is a block 616. If there is a then parent of a context 
node is indicated, block 618. 

[0048] Determine if 7' is preceded by V, block 620. If 7' is preceded by then 
conform to relative addressing mode '11' of BiM navigation command, block 622. 
[0049] In order to decode access units, a decoder needs an indication of where in an 
encoded data stream to begin decoding. Synchronization points are one example of 
points at which a decoder begins decoding and produces correct output. In one 
embodiment, a synchronization point is a random access point associated with a 
description fragment where a decoder can decode a part of the description and obtain a 
semantically correct portion of the description. As explain earlier, a semantically correct 
fragment is one that corresponds to a descriptor, description scheme, or datatype defined 
in the MPEG-7 schema. 

[0050] One embodiment of a synchronization point is a reset. In one embodiment, a 
reset point is implemented using a "reset" update command. The access unit containing a 
reset command contains a fragment, or fragments, that form a complete description tree 
with respect to the schema. Such an access unit serves as a reset point for a decoder as it 
resets the state of the description decoding process. One example of a reset point is 
content description tree 200 in Figure 2. 
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[0051] Yet another embodiment of a synchronization point is a fragment. Examples 
of fragments include fragments 202 1-3. A fragment is an arbitrarily connected 
component of the description tree that does not need to be complete with respect to any 
XML schema. 

[0052] In the encoded data stream, a special sequence can be used to indicate the 
occurrence of a synchronization point. Figure 7 illustrates one example of a 
synchronization code that is possible for MPEG-7 BiM syntax. The twenty-four bit 
pattern 700, followed by a three bit pattern 702 indicates that a synchronization point is 
present. Three-bit pattern 702 indicates MPEG7Unit or MPEG7Main. In order to 
indicate a fragment, modify three bit pattern 702 to reflect signaling of a fragment. 
[0053] In one embodiment, signal synchronization points at the AU level by defining 
an attribute for MPEG7_AccessUnit called "sync_point," enumerating the three 
possibilities. 

[0054] Figure 8 illustrates a distributed MPEG-7 description. Content description 
tree 800 is distributed among hosts 802. Host 802-1 stores fragment 804-1, host 802-2 
stores fragment 804-2, host 802-3 stores fragment 804-3, and host 802-4 stores fragment 
804-4. In the prior art, in order to transmit content description tree 800, all fragments are 
transmitted from a single encoder. 

[0055] Rather than transmit all the fragments values in an access unit, a single host, 
for example host 802-3 with description data in fragment 804-3, transmits fragment 804- 
3 with pointers, or fragment references, to audio description data in fragment 804-1 and 
video description data in fragment 804-2. 
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[0056] Figure 9 illustrates one possible implementation of a fragment reference with 
access units, which is an extension and modification of the implementation of access 
units illustrated in Figure 3. Access unit 900 contains fragment update 902. Rather than a 
payload in fragment update 902, as would be found in the prior art, the invention shows 
fragment reference 904. Fragment reference 904 is a reference that identifies an 
accessible resource containing the fragment. 

[0057] In one embodiment, any resource designated by a uniform resource identifier 
(URI) can be available as the source for a fragment value. For example, a URI of the 
form "http://. . indicates Hypertext transfer Protocol (HTTP) form of a URI and points 
to referenced fragment 906. In one example, the source is a node identified in some 
MPEG-7 description, though the source is not necessarily in the same description 
document as the description document being transmitted. 

[0058] In another embodiment, the reference is to a previously sent node that is used 
as a fragment value. This allows transmission of identical subtrees. The reference is 
typically in XPath. 

[0059] In another embodiment the reference is to a node in the current description 
tree. In one example the reference uses an XML EDREF. 
[0060] Any resource that represents an MPEG-7 description whose type is 
appropriate for the fragment value can be referenced. In one embodiment the referenced 
fragment comes from the same description tree as the one being transmitted. In another 
embodiment the referenced fragment comes from a different description tree as the one 
being transmitted. Fragment reference 904 preferably resolves to a node that is the root 
of a description reference. 
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[0061] There are several embodiments in which a decoder (not shown) accesses the 
referenced fragment. In one embodiment the decoder receives the referenced fragment 
through an MPEG-7 transmission system whereby the fragment is transmitted as a partial 
description using access units. 

[0062] In another embodiment the decoder stores fragment values. The decoder 
compares referenced fragments to stored fragments and if a referenced fragment is stored 
then the decoder obtains the fragment from storage; otherwise, the decoder retrieves the 
fragment from the referenced location. 

[0063] In another embodiment the decoder uses an HTML address to access an XML 
document from a network (not shown). 

[0064] In another embodiment the decoder accesses fragment references only when 
needed, rather than when first received. Such an embodiment allows the decoder to have 
greater control over the timing and reception of description data. For example, if the 
decoder does not need a particular fragment reference, it may never access and retrieve 
that fragment reference. 

[0065] A fragment that is referenced may have links that point to nodes outside of the 
set of nodes inside the fragment's sub-tree. When a fragment is referenced the decoder 
must determine how such links are to be correctly resolved. 

[0066] Internal links point to nodes inside the same document as the fragment. An 
internal link can lose its meaning if the fragment is removed from the original document. 
An external link is points to a node in another document. One example of external links 
are links implemented in XML by a URI reference. External pointers within a fragment 
do not change their meaning if they are removed from the original document. 
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[0067] Figure 10 illustrates one example of internal and external links. Doc 1000 has 
node 1002 with an internal pointer to node 1004. Nodes 1002 and 1004 have identifiers 
unique to doc 1000, for example nodes "a" and "b", respectively. Node 1002 references 
node identified as "b", which is node 1004. 

[0068] Suppose that a decoder references a fragment sub-tree rooted at node 1002 in 
Doc 1 and another fragment 1008 in Doc 2. Further suppose that a the sub-tree rooted at 
node 1012 in Doc 1010 is to be replaced with a fragment that is referenced, or a 
referenced fragment. 

[0069] If the update for fragment 1002 is applied to document 1010 then the result is 
shown as document 1016. Note that in the updated sub-tree 1018 the internal link shown 
at node 1020 points to the node B in document 3 rather than the original target of the 
internal link, which was node 1004 in document 1. 

[0070] In another embodiment, when fragment 1008 is used to update node 1012 of 
document 1010 the result is illustrated as document 1022. The updated sub-tree at node 
1024 still points to the node B in document 1010 as in the original document 1006. 
[0071] In another embodiment, node 1002 is translated into an external link to node 
1004 (node 'B' of docl). 

[0072] In yet another embodiment, within a fragment update an encoder encodes a 
fragment reference with a binary encoding format and a command with a text-based 
encoding format. 

[0073] In yet another embodiment, a fragment update comprises both a payload and a 
fragment reference. 
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[0074] Referencing fragments makes, in one embodiment, distributing MPEG-7 
description data at the system level transparent to the description itself. Fragment 
references enable distribution of large descriptions across multiple servers. 
[0075] Referencing fragments also allows sharing and replicating description 
references that are used in multiple descriptions. 

[0076] Transmission size is compressed if there are common references within a 
description tree. An encoder transmits a reference once and then sends a pointer to that 
reference in subsequent access units. 

[0077] In one embodiment the DDL definition for decoder configuration information, 
named DecoderConfig here is: 

<element name= "DecoderConfig" type= "mpeg7:DecoderConfigType'7> 
<complexType name= "DecoderConfigType"> 
<choice> 

<element name= "Predefined"> 
<simpleType> 

restriction base= "positiveInteger"> 
<minlnclusive value= "l'V> 
<maxlnclusive value= "255'7> 
</restriction> 
</simpleType> 
</element> 

<element name= "SchemaReference" type= "uriReference" minOccurs= 
"1" maxOccurs= "unbounded"/> 
</choice> 

<element ref= "MPEG7_AccessUnit" minOccurs= "0" maxOccurs= "1'7> 
<elementref= "Profile" minOccurs= "0" maxOccurs= 'T7> 
</complexType> 

Predefined indicates whether or not to use a predefined configuration. In one 
embodiment a list of uniform resource identifiers (URIs) follows. In another 
embodiment the number of URIs is zero and a well-known set of schemas is used instead. 
SchemaReference is an URI to reference the schema. AccessUnit provides the 'reset' 
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and initial state description, supplied inline, that forms the basis of the reset and initial 



states. 

[0078] In one embodiment, the DDL representation definition for an access unit, 
called MPEG7_AccessUnit here, is: 

<complexType name=="MPEG7_AccessUnit"> 

<element name ="update" type="updateCommand" minOccurs=="l" 

maxOccurs="unbounded'7> 
<attribute name="executionOrder"> 

<simpleType base="string"> 

<enumeration value=" sequential"^ 
<enumeration value="parallel"/> 
</simpleType> 
</attribute> 
</complexType> 

<complexType name="updateCommand"> 

<element name="command" type="updateType"/> 
<element name="location" type="updateLocation"/> 
<element name= 'value" type="updateValue"/> 

</complexType/> 

<simpleType name="updateType" base="string"> 

<enumeration value="add"/> 

<enumeration value="delete'7> 

<enumeration value="change"/> 

<enumeration value="reset"/> 
</simpleType> 

<complexType name="updateLocation"> 

<attribute name="fragmentID" type="uriReference"/> 

<attribute name="position" type=' boolean" use="default" value="l"/> 

</complexType> 
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<complexType name="updateValue" minOccurs="0" maxOccurs=T'> 
<choice> 

<element name="binaryValue" type="binaryUpdate'7> 
<element name="textualValue" type="textualUpdate'7> 
<elementref resetContentDescription"/> 
</choice> 

<attxibute name="schemalndex" type="integer'7> 
</complexType> 

<simpleType name="listOfByte" base="Byte" derivedBy="list"> 

<complexType name="VectorB" base="listOfByte" derivedBy="extension"> 

<attribute ref ="mpeg7:dim'7> 
</complexType> 

<complexType name="binaryValue"> 

<element name="value" type="VectorB'7> 

<! Using "NOTATION" will point to which decoder to use for nonXML 
content. 

and it is optional> 
attribute name="binaryType" type="NOTATION" use="optional'7> 

</complexType> 

<complexType name="textualValue"> 

<any namespace= "##other" minOccurs^'l" maxOccurs="unbounded" 

processContent=" skip"^ 
</complexType 

UpdateType indicates the type of update command used, for example add, delete, change 
and reset. In one embodiment updateType is update command 404. In one embodiment 
the reset command replaces the whole content description tree with a predefined or 
referenced content description tree. updateLocation uses a subset of the Xpath langage to 
indicate where the update command executes. In one embodiment updateLocation is 
navigation part 402. In one embodiment there is a flag indicating whether an addition 
occurs before or after the identified location. updateValue indicates the value to be added 
or replaced. In one embodiment updateValue is payload 406. In one embodiment the 
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update value is specified either as a binary or textual value using the binary Value or 
textualValue datatypes respectively. 

[0079] In another embodiment, DDL representation syntax for a textually-encoded 
access unit is: 

<element name= "AccessUnit type="mpeg7:AccessUnitType/> 
<complexType name= "AccessUnitType"> 
<sequence> 

<element name= "FragmentRef ' type= "mpeg7:FragmentUpdateType" 
minOccurs= "0" maxOccurs= "unbounded'7> 
</sequence> 

<attribute name= "Order" use="optional'7> 
<simpleType> 

restriction base= "string"> 

<enumeration value= "seq'7> 
<enumeration value= "par'7> 
</restriction> 
</SimpleType> 
</attribute> 

<attribute name= "auID" type= "NcName"> 
</complexType> 

In one embodiment a fragment reference associates the sub-tree included in the same AU 
with its child sub-tree included in another AU. In another embodiment a fragment update 
is used for dynamic building up a content description tree. In one embodiment order is 
used to signal as to whether the update order should be preserved between a sequence of 
FragmentUpdates. 

[0080] In another embodiment, DDL representation syntax for the syntax for 

accessing units allows either fragments or fragment references as follows: 

<complexType name="FragmentPayloadType"> 
<complexContent> 
j<choice> 

<element name="Payload M type='mpeg7:Mpeg7RootType'7> 
<element name="Reference" type="mpeg7:ReferenceType7> 
</choice> 

<attribute name="PartialInstantiation" type="boolean" 
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use="default" value="false'7> 

</complexContent> 
</complexType> 

[0081] Payload contains the value of a description fragment to be updated. 
Reference contains a URI reference to fragment that can be used as the source for the 
value of the update. Partiallnstantiation is an attribute used to signal that one or several 
elements of the fragment have empty content. Payload contains an MPEG-7 root unit, 
which is either a complete description or a single descriptor or description instance. 
[0082] In one embodiment, ConnectionPoint indicates a node of the sub-tree to which 
a child sub-tree should be connected. In one embodiment FragmentLocation indicates a 
child sub-tree included in another AU. 

[0083] Figure 1 1 illustrates an embodiment of computer system 1 100 that implements 
the principles of the invention. System 1 100 includes a stand-alone or portable 
computing device. Computer system comprises processor 1 102, storage device 1 104, and 
MPEG encoder 1 106, all coupled together with interconnect 1 108, such as a bus or a 
point-to-point pointer. In addition, a number of user input/output devices (not shown) 
couple to system 1 100. 

[0084] Processor 1 102 represents a central processing unit of any type of architecture 
(e.g., Intel architecture or Sun Microsystems architecture), or hybrid architecture. In 
addition, processor 1 102 could be implemented in one or more chips. Storage device 
1 104 represents one or more mechanisms for storing data such as content description 
trees or fragments. Storage device 1104 represents read-only memory (ROM), random 
access memory (RAM), magnetic disk storage media, optical storage media, flash 
memory devices, and/or other machine-readable media. Interconnect 1 108 represents one 
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or more buses (e.g., accelerated graphics port bus, peripheral component interconnect 
bus, industry standard architecture bus, X-Bus, video electronics standards association 
related to buses, etc.) and bridges (also termed bus controllers). 

[0085] While this embodiment is described in relation to a single processor computer 
system, the invention could be implemented in a multi-processor computer system. In 
addition to other devices, one or more networks 1110 may be present. Network 1110 
represents one or more network connections for transmitting data over a machine- 
readable media. The invention could also be implemented on multiple computers 
connected via such a network. 

[0086] Storage device 1 104 has stored therein data 1 1 12 and program instructions 
1 1 14 (e.g., software, computer programs, etc.). Data 1 1 12 represents data stored in one 
or more of the formats described herein. Program instructions 1 1 14 represent the 
necessary code for performing any and/or all of the techniques described with reference 
to Figures 2-10. It will be recognized by one of ordinary skill in the art that storage 
device 1104 preferably contains additional software (not shown), which is not necessary 
to understanding the invention. 

[0087] System 1 100 also includes encoder 1 106 coupled to processor 1 102 and 
storage device 1 104. Encoder 1 106 is not necessarily based in hardware, it may also be 
stored as instructions 1 1 14 in storage device 1 104 and executed by processor 1 102. 
Encoder 1 106 is used for encoding control signals and/or microcode entry points sent by 
processor 1 102. In response to these control signals and/or microcode entry points, 
encoder 1106 encodes the control signals so that they may be transmitted across network 
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1 1 10 to system 1116 where processor 1 1 18 and decoder 1 120 will perform the 
appropriate operations. 

[0088] The particular methods of the invention have been described in terms of 
computer software with reference to a series of flow diagrams. The methods to be 
performed by a computer constitute computer programs made up of computer-executable 
instructions illustrated as blocks (acts). Describing the methods by reference to a flow 
diagram enables one skilled in the art to develop such programs including such 
instructions to carry out the methods on suitably configured computers (the processing 
unit of the computer executing the instructions from computer-readable media). The 
computer-executable instructions may be written in a computer programming language or 
may be embodied in firmware logic. If written in a programming language conforming 
to a recognized standard, such instructions can be executed on a variety of hardware 
platforms and for interface to a variety of operating systems. In addition, the present 
invention is not described with reference to any particular programming language. It will 
be appreciated that a variety of programming languages may be used to implement the 
teachings of the invention as described herein. Furthermore, it is common in the art to 
speak of software, in one form or another (e.g., program, procedure, process, application, 
module, logic...), as taking an action or causing a result. Such expressions are merely a 
shorthand way of saying that execution of the software by a computer causes the 
processor of the computer to perform an action or a produce a result. It will be 
appreciated that more or fewer processes may be incorporated into the methods as 
described above without departing from the scope of the invention, and that no particular 
order is implied by the arrangement of blocks shown and described herein. 
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[0089] In the foregoing specification, the invention is described with reference to 
specific embodiments thereof. It will, however, be evident that various modifications and 
changes may be made thereto without departing from the broader spirit and scope of the 
invention as set forth in the appended claims. The specification and drawings are, 
accordingly, to be regarded in an illustrative rather than a restrictive sense. 
[0090] It is also important to note that while the embodiments described here have 
referred to MPEG-7 descriptions, the principles of this invention apply to delivering any 
XML document from an encoder to a decoder over a communication channel. 
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